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METHOD FOR STREAM MERGING 



Field of the Invention 

The present invention relates generally to media streaming, and, 
5 more particularly, to optimizing multicast delivery of a media stream to a plurality 
of clients in a communication network. 



Background of the Invention 

The simplest policy for serving a media stream to a plurality of 

10 clients - e.g., in a video or audio-on-demand system - is to allocate a new media 
stream for each client request. Such a policy, however, is very expensive, as it 
requires server bandwidth that scales linearly with the number of clients. One of 
the most common techniques for reducing server bandwidth is to "batch" 
multicasted streams into scheduled intervals. Clients that arrive in an interval are 

15 satisfied by a full stream at the end of the interval. Bandwidth is saved at the 
expense of longer guaranteed startup delay. 

One recent proposal to reduce server bandwidth is to use a server 
delivery policy referred to as "stream merging." See, e.g., D. L. Eager, M. K. 
Vernon, and J. Zahorjan, "Minimizing bandwidth requirements for on-demand 

20 data delivery," Proceedings of the 5* hitemational Workshop on Advances in 
Multimedia Liformation Systems (MIS '99), 80-87, 1999. Stream merging 
assumes that there are multicast media channels and that each client has adequate 
buffer space and receive bandwidth that is at least twice the playback bandwidth. 
Under stream merging, the client receives two (or more) channels of the media 

25 stream: one channel starting from the beginning of the stream, a second channel 
commencing mid-stream, e.g. as it is being multicast to other clients who have 
arrived at an earlier time. The client commences processing of the first channel 
while buffering the second channel. When the first channel reaches the point in 
the stream corresponding to the beginning of the buffered stream from the second 

30 channel, the client switches to the buffered stream (thereby "merging" the 
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Streams) and the transmission on the first channel may be dropped - thereby 
saving bandwidth. 

Summary of the Invention 

35 The present invention is directed to a system and method for 

stream merging which improves upon the prior art by utilizing optimized merging 
patterns, hi accordance with an embodiment of the present invention, the server, 
channels, and clients in the stream merging architecture have specific and well- 
defined roles. The server informs the client which streams to monitor and for how 

40 long; the server advantageously need only communicate with the client during 

setup of the media stream. In accordance with another embodiment of the present 
invention, the server optimizes the merging of multiple client streams by 
minimizing the cost of different merge patterns. Optimal solutions are disclosed 
for when stream initiations are both known and unknown ahead of time. Where 

45 streams initiations are regular and known ahead of time, optimal merging patterns 
can be calculated using a novel closed form solution for the merge cost. Where 
the stream initiations are not regular, the server can utilize the property of 
monotonicity to quickly calculate optimal merge patterns. Where stream 
initiations are not known ahead of time, the server can readily decide whether to 

50 initiate a new stream or whether to merge the new stream into the existing merge 
tree, advantageously into the right frontier of the merge tree. The inventors 
disclose that optimal merge trees have interesting relationships to Fibonacci 
number recurrences and that a Fibonacci merge tree structure can be 
advantageously used in an on-line stream merging system. 

55 These and other advantages of the invention will be apparent to 

those of ordinary skill in the art by reference to the following detailed description 
and the accompanying drawings. 

Brief Description of the Drawings 

60 FIG. 1 illustrates a multicast network with a server and multiple 

clients. 



2 



Docket No. 2000-0218 



FIG. 2 is a representation of the components of the server and 
cHents in FIG. 1. 

FIG. 3 is a timehne illustrating the process of stream merging. 
65 FIG. 4 is a flowchart of processing performed by a client, in 

accordance with an embodiment of the invention. 

FIG. 5 is a flowchart of processing performed by a server using 
off-line stream merging, in accordance with an embodiment of the invention. 
FIG. 6 is a conceptual representation of a merge tree, 
70 corresponding to the stream merging example shown in FIG. 3. 

FIG. 7 is an abstract diagram illustrating the recursive structure of 
a merge tree T with root r. 

FIG. 8A through 8D are conceptual representations of Fibonacci 
merge trees for n = 3, 5, 8, 13, respectively. FIG. 8E and 8F illustrate two merge 
75 trees for n = 4. 

FIG. 9 shows the values of /(n) for 2 < < 34. 
FIG. 10 is a flowchart of processing performed by a server using 
on-line stream merging, in accordance with an embodiment of the invention. 

Fig. 1 1 is an abstract diagram illustrating the transformation from 
80 r„_i to rj_, in the basic merging rule in on-line stream merging. 

Fig. 12 is an abstract diagram illustrating the transformation from T 

tor. 

Fig. 13 is an abstract diagram illustrating the transformation from 
r„_i to r„'_i in a dynamic tree algorithm. 

85 

Detailed Description 

In FIG. 1, a plurahty of media clients 110, 120, ... 130 are 
provided access to media streams by a multicast-enabled network 100, as is well 
understood in the art. A media server 150 stores and multicasts particularly 
90 popular media on multiple channels 101 at different times across network 100 to 
satisfy client demands. Each client, e.g. client 110, issues a request to the server 
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150 for a media stream, otherwise referred to herein as an "arrival" at the server 
150. At each arrival time, a stream is scheduled by the server 150, although for a 
given arrival the stream may not run until conclusion because only an initial 

95 segment of the stream is needed by the client 1 10. The server 150 issues a 

response to the client 101 that informs the client 101 which streams to monitor 
and for how long. The request and the corresponding response can be made using 
any known or arbitrary communication protocol. After this exchange, the client 
101 needs no further interaction with the server 150. The client 1 10 receives and 

100 buffers data from two or more streams at the same time, in accordance with the 
response from the server 150, while a user can "play" or "view" the data 
accumulated in the client buffer. Each client 1 10. .. 130 can receive all the parts of 
the media stream and play them without any interruption starting right after the 
time of its arrival. 

105 FIG. 2 is a conceptual diagram of the components of the client 210 

and server 250, corresponding to the client 1 10 and server 150 in FIG. 1. The 
server 250 comprises a computational engine 251, which constructs optimized 
stream merging patterns, as further described herein, connected to an external or 
internal storage device 252 which may be used for the storage of the media 

1 10 stream(s). The computational engine 25 1 controls a network interface 255 which 
forwards the relevant media streams at the relevant times through the multicast 
network 100. Although the exact number of channels is not relevant to the 
invention, four multicast channels 201, 202, 203, 204 are shown in FIG. 2. The 
client 210 has its own network interface 215 capable of receiving data from the 

115 multicast channels 201-204. The client 210 has its own computational engine 

211, which merely follows the stream merging rales and the receiving procedure 
described below. The client's computational engine 211 directs and stores data 
received from particular multicast channels to a memory buffer 212. The client 
210 can have a player component 213, which is capable of presenting the data in 

120 the media stream to a user. At the top of FIG. 2, the client 210 is depicted 
commencing the processing of a media stream, after obtaining a receiving 
procedure from the server 250. As shown in the bottom of FIG. 2, the client 210 
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buffers the data received from two multicast channels while simultaneously 
sending the initial parts of the stream to the player component 213. The exact 

125 nature of the processing performed by the client 210 and the server 250 is now 
described in further detail. 

For purposes of describing the different embodiments of the 
invention, it is advantageous to use a discrete time model, as illustrated by the 
timeline shown in FIG. 3. The horizontal axis is the time axis and the vertical axis 

130 shows the particular unit of the full stream that is transmitted. Time is assumed to 
be slotted into unit sized intervals, each slot / starting at time t-l where the 
length of a full stream is L units. Let ,f2,— be a sequence of arrival times for 
clients. Clients that arrive at the same time slot can be considered as one client 
and serviced in the same manner. At each arrival time, a new stream is initiated - 

135 although for a given arrival the stream may be truncated in the context of the 

stream merging process. The client arrival time is used herein interchangeably to 
identify the client(s) arrival and the stream initiated at the particular time. The 
time interval can be a reflection of the delay constraints of the media streaming 
system: e.g. a two hour streaming movie which can tolerate a 4 minute startup 

140 delay can be configured with a time interval of 4 minutes making each movie L = 
30 units long. Note that although the invention is presented in the context of a 
discrete time model, it is readily extendible to a non-discrete time model by letting 
the time slots be as small as desired and where the value of L is as large as needed. 
FIG. 3 shows a full stream of some length L > 24 commenced at time slot 1 with a 

145 series of other streams commenced at later times and truncated and merged into 
the full stream. 

FIG. 4 is a flowchart of the processing performed by a media client 
1 10, in accordance with a preferred embodiment of the invention. At step 401, the 
media client 110 issues a request for a media stream to the media server 150. As 
150 described in further detail below, the server 150 constructs a stream merging 

pattern and, at step 402, retums a schedule of arrival times for a plurality of k + I 
streams denoted as xo, xi, ... , Xk and referred to herein as a receiving procedure 
for the client. Thereafter, the client 110 needs no further communication with the 
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media server 150 and, at steps 403 to 408, can merely "listen" to the identified 

155 multicast channel at the particular associated time periods represented in the 

receiving procedure. At step 403, the counter i is set to 0. From time slot Xk until 
time slot 2xk-Xk-i, the client receives different parts of the requested media 
stream from two different multicast channels. At 405, the client receives the 
beginning of the requested stream, namely parts 1 , ...,Xk-Xk-u from stream Xk 

160 and can immediately begin utilizing the stream. Simultaneously at 406, the client 
buffers the parts x^t - -^i - 1 + 1 , • • • , 2^^ - 2jC;t _ i from stream Xk-i. At step 407, 
the counter i is incremented by 1 and the steps 404 to 406 are repeated until i 
equals ^ - 1 . From time slot 2xk -Xk-i until time slot 2xk - _ i, parts 
2xk - 2xk ... ,2xk-Xk-i-Xk-i-\ are received from stream xt - vi^hile parts 

165 2xk-Xk-i-Xk-i-i + 1, ... , 2xk-2xk-i-i are received from stream Xk-i-i. The 
parts are buffered and played as needed. Finally, at step 408, the last parts of the 
media stream 2{xk- xq) + I, ... ,L are received and buffered from stream xq from 
time slot 2xk - xq until time slot xq + L. Note that although FIG. 4 illustrates the 
invention with two multicast receiving streams, the invention is not limited to the 

170 "receive-two" model shown and described herein. One of ordinary skill in the art 
can readily extend the embodiment to multiple multicast receiving streams, 
although it can be shown that the benefits of adding receiving bandwidth become 
marginal. 

The media client 110 advantageously avoids complex computa- 
175 tions and need only follows the basic stream merging rales reflected in FIG. 4. As 
an example of the processing performed in FIG. 4, consider the stream merging 
pattern set forth in FIG. 3. A full stream of length L has already been conamenced 
at time slot 1. The client, upon issuing a media stream request just before time 
slot 13, is issued a receiving procedure of streams xq = 1, xi = 9, X2 = 12, X3, = 13, 
180 where k = 3. At time slot 13 (where the counter / = 0 in FIG. 4), the client 

receives the first part of the stream from stream X3 while buffering part 2 from 
stream X2. For the next three time slots (i= 1), the client receives and buffers parts 
3-5 from stream X2 while receiving and buffering parts 6-8 from stream xi. For 
the next eight time slots (i = 2), the client receives and buffers parts 9-16 from 
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185 Stream xi while receiving and buffering parts 17-24 from stream xq. Finally at the 
last step in FIG. 4, the client receives the remaining parts of the stream from the 
full stream xq. 

The media server 150 is responsible for computing the stream 
merging patterns and for disseminating the proper receiving procedures to its 

190 clients. FIG. 5 sets forth a simplified flowchart of the processing performed by 
the server 150 in the "off-line" situation, i.e. where the stream initiations are 
known ahead of time. At step 501, the server 150 receives reservation requests in 
advance from each of the clients. At step 502, the server 150 calculates an 
optimal merging schedule with corresponding receiving procedures xo, ... ,Xk for 

195 each client, and, at step 503, the server 150 transmits the receiving procedures to 
each client. At step 504, the server 150 commences the multicast transmissions, 
in accordance with the schedule calculated at step 502. 

A preferred method of calculating the merging schedule would be 
to optimize the "cost" of different merging patterns. For example, the server 

200 could minimize the sum of the lengths of all of the streams in the merging pattern, 
which would be equivalent to minimizing the total number of units (total 
bandwidth) needed to serve all the clients. In that context, and in accordance with 
an aspect of the invention, FIG. 6 illustrates a particularly helpful abstraction of 
the diagram set forth in FIG. 3. The inventors refer to the abstraction as a "merge 

205 tree." A merge tree is an ordered labeled tree, where each node 601-608 is labeled 
with an arrival time and the stream initiated at that time. For example, nodes 601, 
602, 603 and 604 correspond to the arrival times/streams xo = 1, xi = 9, X2 = 12, X3 
= 13, described above. Each new stream can only merge to an earlier stream, and 
the children of a given node are ordered by their arrival times. If a preordered 

210 traversal of the labeled tree yields the arrival times in order, as does the tree 

illustrated in FIG. 6, the tree is said to have a "preorder traversal property." An 
optimized solution for a given client arrival sequence is a merging pattern which 
can be represented as a sequence of merge trees, which the inventors refer to as a 
"merge forest." 
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215 Given a merge tree T, the root of the tree represents a full stream of 

length L and is denoted by r(Z). If x is a node in the merge tree, (.{%) is defined as 
the length in T; that is, l{x) is the minimum length needed to guarantee that all 
the clients can receive their data from stream x using the stream merging rules. A 
helpful distinction can be made between "merge cost" and "full cost" where the 

220 merge cost includes just the cost of merging and not the full stream which is the 
target of the merging. The merge cost is defined as 

Mcost(7)= 

That is, the merge cost of a tree is the sum of all lengths in the tree except the 
length of the root of the tree. The full cost counts everything: merging cost and 

225 full stream cost for all of the merge trees in the forest. The optimal merge cost is 
defined as the minimum cost of any merge tree for the sequence. An optimal 
merge tree is one that has optimal merge cost. There is a simple formula for 
calculating the minimum length required for each node of a merge tree. Let 
xii^r{T) be a non-root node in a tree T. Then 

230 i{x) = 2z(x) -X- p(x) 

= (z(x)-x) + {z(x)-p(x)) 
where z(x) is the arrival time of the last stream in the subtree rooted at x and p{x) 
is a parent of x. hi particular, if x is a leaf, then z(x) = x and £(x) = x - p(x) . 
The length of stream x is composed of two components: the first component is the 

235 time needed for clients arriving at time x to receive data from stream x before they 
can merge with stream p(x); the second component is the time stream x must 
spend until the clients arriving at time ^(x) merge to p(x). Using the preorder 
traversal property of optimal merge trees, there is an elegant recursive formula for 
the merge cost of a tree T, illustrated by Fig. 7. A key property of merge trees is 

240 that for any node the subtree rooted at ti contains the interval of arrivals tt, 

ti+j,..., tj, where z(td = tj- Furthermore, tj is the right most descendant of ti. As a 
result, any merge tree can be recursively decomposed into two in a natural way as 
illustrated in FIG. 7. FIG. 7 shows the recursive structure of a merge tree J with 
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root r. The last arrival to merge directly with r is x. All the arrivals before x are 
245 in T and all the arrivals after x are in T' ' and z is the last arrival. Thus, it can be 
shown that 

Mcost(r) = Mcost(r') + Mcost(r' ') + 2z-x-r 
250 The full cost of a forest Fofs merge trees Ti,. ..,Ts, is defined as 

Fcost(F) = s-L+^ Mco&t(Ti) 

that is, the full cost of a forest is the sum of the merge costs of all its trees plus s 
255 times the length of a full transmission, one per each tree. Note that the length of 
any non-root nodes in T cannot be greater than L. Merge trees that do not violate 
this condition are referred to by the inventors as "L-trees." The optimal full cost 
for a sequence is the minimum full cost of any such forest for the sequence. An 
optimal forest is referred to as one that has optimal full cost. 
260 Defme M{i, j) to be the optimal merge cost for the input sequence 

t^,...,t^. The optimal cost for the entire sequence, thus, is M{l,n). The optimal 
cost may be computed using dynamic programming. M{i, j) can be recursively 
defined as follows 

M (/, j) = min{M(i,k - 1) + M (k, j) + (2t- - - f, ) } 

i<k< j 

265 with the initialization M(i,i) = 0 . This recursive formulation naturally leads to an 
0(n^) time algorithm using dynamic programming, as is well understood in the 
art. The time to compute the optimal merge cost may be reduced to 0(n^) be 
employing the classic technique of monotonicity. See, e.g., D. E. Knuth, 
"Optimum Binary Search Trees," Acta hiformatica. Vol. 1, 14-25 (1971). Define 

270 r(i,i) = i and, for i< j, as follows: 

r(i, j) = max{k : M(i, j) = M{i,k - 1) + M{k, j) + 2t^ -t^^-t^} 

9 
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Thus, r(i, j) is the last arrival that can merge to the root in some optimal merge 
tree for . Monotonicity is the property that for 1 < i < n and l< j<n 

r(ij-l)<rii,j)<r(i + l,j) . 

275 It should be noted that there is nothing special about using the max in the 

definition of r{i, j) ; the min would yield the same inequality. Monotonicity can 
be demonstrated using a very elegant method of quadrangle inequalities. See F. F. 
Yao, "Efficient Dynamic Programming Using Quadrangle Inequalities," 
Proceedings of the 12* Annual ACM Symposium on Theory of Computing 

280 (STOC '80), 429-35 (1980); A. Borchers and P. Gupta, "Extending the 

Quadrangle Inequality to Speed Up Dynamic Programming," Information 
Processing Letters, Vol. 49, 287-90 (1994). Thus, the search for the k in the 
above recursive formulation can be reduced to r{i + 1, j) ~ r(i, + \ 
possibilities from j-l possibilities. The key point is that the right most stream 

285 that merges to the root of an optimal tree from i to i + j is confined to an interval 
and these intervals are almost disjoint for i not equal to j. 

An optimal algorithm for calculating the full cost uses the optimal 
algorithm for merge cost above as a subroutine. Let ?i,/2, be a sequence of 
arrivals, and let L be the length of a full stream. For 1 < f < « , define G(i) to be 

290 the optimal full cost for the last n-i + l arrivals Define G(n + 1) = 0 and 

f or 1 < / < n , 

G(i)^L + min{M(i,k-l) + G(k}:i<k<n + l and -f, <L-1} . 
The optimal full cost is G(l) and the order of computation is 
G(n + V),G(n),...,G(l) . The optimal full cost algorithm proceeds in two phases. 
295 In the first phase, the optimal merge cost M(i, j) is computed for all i and j such 
that 0<tj-t, < L - 1 , so that these values can be used to compute G(i) . In the 
second phase, G{i) is computed from i - n down to 1 using the above equation. 
The intuition for the above is as follows: a full stream must begin at ?i , and there 
are two possible cases in an optimal solution. Either all the remaining streams 
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300 merge to this first stream or there is a next full stream for some k<n. In the 
former case, the optimal full cost is simply L + M{\,n). In the latter case, the 
optimal full cost is L + M(l,fc -1) plus the optimal full cost of the remaining 
arrivals t^,...,t,. In both cases, the last arrival to merge to the first stream must be 
within L-1 of the first stream. The full streams can be identified inductively. 

305 Both phases of the optimal algorithm together run in time 0{nm) , where m is the 
average number of arrivals in an interval of length L-1 that begins with an 
arrival. The above algorithm is practical enough to schedule millions of reserved 
arrivals. 

An important special case which simplifies the above optimal 
3 ] 0 merge cost solution is when an arrival is scheduled at every time unit, referred to 
herein as the "fully loaded arrivals" case. The fully loaded arrivals case can be 
thought of as being a system with a guaranteed maximum delay, where streams 
are scheduled at every time unit regardless of client arrivals. For the case of fully 
loaded arrivals, the value M{i,j) does not depend on i{ti) and;(?;) but rather 
315 depends on their difference j - i. Hence, where M(n) is the minimum cost for a 
merge tree for the arrivals [0, n - 1], the following recursive formula for the 
merge cost for fully loaded arrivals is obtained. 

M{n) = ndn^{M{h) + M{n-h) + 2n-h-2} 

with the initialization M(l) = 0. Using the notation above, the term 2n-h-2 
320 comes from z = n- l, x = h, and r = 0 and then 2z-x-r = 2n-h-2. 
Calculating M(n) for small values of n yields an interesting sequence: 



n 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


.6 


M{n) 


0 


1 


3 


6 


9 


13 


17 


21 


26 


31 


36 


41 


46 


52 


58 


64 



A careful examination of this sequence reveals that there is a very elegant 
325 formulation of the merge cost in terms of Fibonacci numbers: 

M(n) = (fe-l)f7-F,^2+2 if F,<n<F,^, 
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where Fk is the Fibonacci number. As is well known in the art, the Fibonacci 
numbers are defined by the following recurrence: F^. = F^_^ + Ff,_2 for k>2, 
where Fo = 0 and Fi = 1 . It can be shown that for n equal to a Fibonacci number 

330 there is a unique optimal tree, which the inventors refer to as a "Fibonacci merge 
tree." FIG. 8A through 8D illustrate such optimal trees for n = 3, 5, 8, 13, with 
corresponding merge costs of M(n) = 3, 9, 21, 46, respectively. Note the structure 
of these optimal trees: the right-most subtree of the tree for n = Fkis the tree for n 
= Fk-2 whereas the rest of the tree to the left is a tree iovn = Fk-\. On the other 

335 hand, for other values of n there can be multiple optimal trees, e.g., FIG. BE and 
FIG. 8F illustrate two optimal trees for four arrivals, both trees having a merge 
cost of six. It is of interest to see which arrivals can be the last to merge in an 
optimal merge tree. Define the following two auxiliary functions: 
H{n,h)-=M{h) + M{n-h) + 2n-h-2 
I(n) = {h:M(n) = H(n,h)} 

340 so that the value of M(n) can be determined by minimizing H(n, /z) for \<h<n- 
1. The members of 7(n) are all the arrivals that can be the last merge to the root in 
an optimal merge tree for [0, n - 1]. FIG. 9 shows the values of I{n) for 2<n< 
34. Each set /(n) is an interval and the pattern depends heavily on Fibonacci 
numbers. The following definitions are useful in characterizing these intervals. 

345 For a given n,n = Fk + m for some 0 < m < F^t _ i, define the following three 
intervals: 

I,(n) = [F,_„F,_,+m] 
l2in)^[F^_2+m,F^_,+m] 
I,(n) = [F,_,+m,F,] 

A given interval Ii(n) will be the I(n) for a certain range of m in the interval 
[0, Fjt-i]. Define those ranges as: 

m,(k) = [0,F,_,] 
350 m^ik) = [F,_„F,_^] 

m,(k) = [F,_„F,_,] 

Then it can be shown by induction that if m g m.(k) , 
12 
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M(n) = (^-l)n-F^+2+2 and /(n) = /,.(«) 
which can be used as the basis of an efficient algorithm to construct an optimal 
merge tree for fully loaded arrivals. 
355 An optimal merge tree for fully loaded arrivals can thus be 

computed in time 0(n) using the above closed form solution. Let [0, n - 1] be an 
input. Define r(i)^maxl(i) for l<f<n. So r(0 is an arrival that can be the 
last merge in an optimal merge tree for the input [0,i-l]. An optimal merge tree 
for the input [ij] can be computed using the following recursive procedure. If i = 
360 j return the tree with label i. Otherwise, recursively compute the merge tree Ti for 
the input [i, i + r( j - / + 1) - 1] and T2 for [i + r{j - i + 1), 7] , then attach the root of 
T2 as an additional last child of the root of Ti and return the resulting tree. This 
procedure is called for the input [0, n - 1] to construct an optimal merge tree. 
With an elementary data structure the tree can be constructed in linear time 
365 provided that r{i) has already been computed for 1 < / < ?j . The Fibonacci 

numbers < n, can be computed in 0(log n) time. The sequence r(l), r(2), .. ., rin) 
can be computed in linear time using the recurrence 

r(0 = ra-l) + l if F,<i<F,+F,_, 
= r(i-l) if F,+F,_^<i<F,,, 
370 with the initialization r(l) = 0 and r(2) = 1 . An optimal forest for fully loaded 
streams can be constructed in linear time. In computing the full cost of a merge 
forest, the cost of the roots must be taken into account. There are basically two 
steps: first, determine how many full streams are in an optimal merge forest and, 
second, where to place the full streams. Define F{L, n, s) to be the minimum cost 
375 of any merge forest for [0, n - 1] where the length of a full stream is L and there 
are exactly 5 roots (full streams). Since at most L-1 streams can merge with a 
stream of length L, it follows that for a given n there must be at least = / l] 
full streams for n arrivals. Hence, 

F{L,n) - min F(L,n,s) 

So<s<n 

380 Notice the extreme cases: L = 1 impHes so = n and n = L-l implies so= I. 
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For a fixed s, the placement of the full streams in an optimal merge forest with s 
full streams can be determined. Where n-ps + r and 0 < r < 5, it can be shown 
that 

F{L,n, s) = sL + rM {p + \) + {s- r)M ( p) 

385 This yields a straightforward linear time algorithm for computing an optimal 

merge forest. First, the above-described Fibonacci formulation of M can be used 
to compute M(l), M(2),. . ., M(L). Next, search for 2Xis{so<s<n) that minimizes 
sL + rM{p + \) + {s-r)M{p) where p = \nls\dinA r = n-ps. To construct 
the merge forest, place r full streams at 0, p -I- l,2{p + 1),..., (r - V){p + \) and 5 - r 

390 full streams at r{p + V},r{p + \) + p,rip + \) + 2p,.,.,r{p + \) + is-r-\)p . Use 
the linear time algorithm for constructing an optimal merge tree to complete the 
forest. The optimal merge forest for fully loaded arrivals can be computed in 0(L 
+ n) time. Note that it is possible to directly calculate the number of full streams 
needed in an optimal merge forest rather than searching for the s that minimizes 

395 the above expression. First, compute h such that F^^^ <L + 2< F^^^ ■ This h can 
be computed in linear number of log operations. Next, compute = \pl Fh\ 
and SQ=\nl L \ . If sq > si then 5o = 5i + 1 minimizes F(L, n, s). Otherwise, 
compute F{L, n, si) and F(L, n, si+l) using the above expression. If the former 
value is smaller, then si minimizes F(L, n, s), otherwise si + 1 does. It is 

400 interesting to note that there are cases where is optimal and i'l + 1 is not, si + I 
is optimal and s\ is not, and both s\ and 5i -i- 1 are optimal. It should be noted that 
a natural guess for s is \n/(\_L/2\+ 1)]. That is, a full stream is scheduled at 
intervals of length about L/2 and each tree contains about L/2 nodes. This value 
of s is not always optimal, but it is optimal in many cases and at the very least 

405 gives a good upper bound for the full cost in the fully loaded arrivals case. 

In contrast to the off-line situation in which client reservations are 
accepted in advance, we next describe the "on-line" situation in which the client 
requests are not known ahead of time. When a new client tn arrives, the media 
server 150 is assumed to have already constructed a merge forest F„_ 1 for the 
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410 preceding n - 1 clients, h, ... ,tn-\ where h is the root of the first merge tree in 
the forest. Given n > 1, a decision must be made to either incorporate into the 
last merge tree in the forest or to start a new merge tree by making tn its root. The 
goal in the on-Hne situation is to obtain results dynamically that are good relative 
to an after-the-fact off-Une computation. 

415 FIG. 10 sets forth a simpUfied flowchart of the processing 

performed by the server 150 in the on-line situation, in accordance with a 
preferred embodiment of the invention. At step 1001, the server 150 receives the 
request from the chent at time slot tn- At step 1002, the server 150 compares 4 - 
tm to the quantity L/2, where is the last root of a merge tree in the merge forest 

420 F„_ 1. If - > L/2, then, at step 1004, the server 150 starts a new merge tree 
with tn becoming the root of the new merge tree in F„. This start rule has many 
justifications. First, if tn-tm< LI2 then can always be incorporated into the 
merge tree rooted at f„, and for - t,n time slots the clients served by tn will be 
receiving two streams simultaneously. Second, there is a serious disadvantage of 

425 trying to incorporate tn into merge tree rooted at tm if tn-tm< LI2. Consider the 
extreme example where = f„ - 1 and /„ = f„-l-hL-l. In this example, f„ can 
merge directly to _ i so that ) = L - 1 . However, stream f„ only receives one 
part of tn -1, namely, its last part. Even worse, no future arrival can receive any 
part of tn because doing so would cause the length of stream tn to exceed L. The 

430 only potential gain in merging f„ to f„_ i is if there are no arrivals in the next L 
slots after tn. 

Assuming no new merge tree will be created, the server 150 must 
then decide how to incorporate the new arrival into the existing merge tree at step 
1003. A new merge tree r„ will then be created which incorporates the arrival tn 

435 into the existing merge tree, referred to as r„ _i for arrivals ti, ... ,tn-i. In order to 
preserve the preorder labeling (and thereby the existing stream merging rules), the 
new arrival should be merged into the "right frontier" of the existing merge tree. 
The right frontier of r„ _ i is the path t\=XQ,xi, ...,Xk = tn-\ where xi+i is the right 
most child of jc, for Q<i<k. For example, consider the case of a new arrival at 

440 time slot 15 in FIG. 3. As reflected on the corresponding merge tree in FIG. 6, the 
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right frontier of the merge tree comprises the nodes 601-604. The pre-order 
traversal property of the tree requires that some node on the right frontier be made 
the parent of the new arrival at slot 15. Note, however, that not every node on the 
right frontier is eligible to be a parent of the new arrival. The arrival at time slot 

445 15 cannot merge with node 604 because the stream xs has already terminated. 
Thus, the arrival can be merged into the root at 601 (stream xq) or into the 
remaining nodes 602 (stream xi) or 603 (stream X2). This basic merging rule can 
be expressed more formally as requiring that r„ = T°_^ or r„ = r;_j for some i > 0 
such that < 2f„_i - X;_i , where rj_i is defined to be the tree T„_i with chosen 

450 as the parent of r„, that is, p^,(tj = x^. This transition from T„_^ to r;_j is 

further illustrated abstractly by FIG. 11. Note that the new right frontier of r„'_i is 

ti =Xo, Xl, ...,Xj, tn. 

The incremental cost of merging tn into r„_ 1 can be expressed as: 

455 MCost(T „) - MCost(T^_, ) = 2r (?„ - t„_, ) + t„-x^ 

where the last part of the cost, ?„ - , is the length of tn and the first part of the 
cost represents the length of each non-root ancestor of 4 due to the change of its 
last descendant from _ 1 to f„. A simple approach to optimizing on-line 

460 streaming would be to choose a parent so as to minimize the incremental merge 
cost, which the inventors refer to as a "best fit" rule. Another approach would be 
to pick a parent which is "closest" in some sense to the new arrival, which the 
inventors refer to as the "nearest fit" rule. For example, the largest i could be 
chosen where the i-th parent has not yet terminated. Unfortunately, it can be 

465 shown that these approaches do not have good performance relative to off-line 
stream merging. 

Instead, and in accordance with another aspect of the invention, it 
can be shown that it is advantageous to force the on-line algorithm to "follow" an 
on-line merge tree as closely as possible. The on-line tree acts as a kind of 
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"governor" in a tree-fit algorithm where each new arrival must merge with a 
member of the right frontier of the on-line merge tree. First, consider the situation 
of a fixed merging pattern, where the sequence of arrivals is not known in 
advance, but its length is assumed to be n. A fixed unlabeled tree T with n nodes, 
referred to by the inventors as a "static" merge tree, is utilized in an "oblivious" 
off-line merging process, which can be considered a "semi" on-line algorithm. 
Given an arrival sequence r = (/i,...,?J , a merge tree is constructed with the 
same structure as T, but with the labels fi, ...,?„ put on the nodes in a preorder 
fashion. Hence, given two arrival sequences t ^T:\\t could be the case that 
Mcost(r(T) ) Mcost(r(T') ). How well a static merge tree T performs can be 
expressed as an approximation ratio ut defined as follows: 



= supj ^^'^'^^^"^^^^'^ : T is an arrival sequence of length n } 



This quantity measures the worst case performance of the static tree T as 
compared with optimal. It turns out that the approximation ratio of a static merge 
tree can be exactiy characterized by measuring what the inventors refer to as its 
"extent." For a static merge tree T and a node x in T, define ui{x) to be the 
number of ancestors of x not counting x and the root of T, and define vr(jc) to be 
the number of right siblings of ancestors of x (including x). The extent of x is 
defined to be: 

Cj (x) = 2uj {x) + Vj (x) + 1 
while the extent of the static merge tree T is: 

e{T) = max{ey(x) : x in T} 
For any static merge tree T, it can be shown that aj = e(r) . The extent can be 
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shown to be a lower bound of the approximation ratio by example. The extent can 
500 also be shown to be an upper bound by induction on the number of nodes in T 

using a transformation of T to an optimal 7*. The goal of the transformation is to 
make x, a node in T which is not the root, the last child of the root of 7*. The tree 
T is formed from two trees T[ and as follows. First, is the subtree of T 
consisting of all nodes that come before x in a preorder traversal of T. What 
505 remains from T after is a sequence of disconnected merge trees Xq, Xi, . . . , Xk 
where Xo is the subtree of T rooted at x and X+i is the subtree of T that is traversed 
in a preorder traversal of T immediately after X; is traversed. The merge tree 
is formed by taking Xo whose root is x and making x the parent of the root of each 
X for \ <i<k in that order. The transformation from Tto 7* is illustrated in FIG. 
510 12. Note that is the subtree of 7* of all arrivals before x and r/ is the subtree 
of T of all arrivals after and including x. This corresponds to the subtrees T and 
T' ', respectively shown in FIG. 7. As a byproduct of the construction the subtree 
to the left of the last merge and the subtree rooted at the last merge each have 
extent less or equal to the extent of T. If the costs of the move are carefully 
5 15 examined, it can be shown that the cost is bounded by e{T) - 1 times the cost of x 
in 7*. The extent, and accordingly the approximation ratio for any static merge 
tree, can be shown to have a lower bound of log^ n-\, where 

+ « 1.618 is the golden ratio and is the positive root of the equation 

=x + l. Furthermore, it is advantageous to note that the extent of a Fibonacci 
520 tree is essentially the same as the lower bound. 

Given the knowledge of the approximation ratio for static trees, a 
new class of dynamic tree algorithms can be defined. Define a "preorder tree" to 
be an infinite tree in which the root has an infinite number of finite size subtrees 
as its children. Such a tree has the property that the preorder traversal provides a 
525 numbering for the entire tree starting with the root numbered 1 . Define T{n\ to be 
the finite subtree of Tof the nodes numbered 1 to n. An "infinite merge tree" is a 
preorder tree labeled with the arrival sequence tuh,... in preorder fashion. An 
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advantageous example of a preorder tree what the inventors refer to as the infinite 
Fibonacci tree 0^. Define the finite Fibonacci trees FTi, FT2, ... as follows. The 

530 trees FTi and FT2 are each signal nodes. The tree FTk is formed from FTk- 1 and 
FTk - 2 by making the root of FTk - 2 the last child of the root of FTj, - 1 . It should be 
clear from the construction of Fr^:that its size is Ft. Furthermore, it can be shown 
by induction that the extent e{FTk ) = k-2fork>2. Define Jas a root with 
infinitely many children where the kth child is the root of the subtree FTk. A 

535 preorder traversal of !F defines the preorder numbering of the nodes with the root 
numbered 1. Define y^[n] to be the subtree of ^ consisting of the n nodes 
numbered from 1 to n. Then, it can be shown for /:> 2, ^[Fk] = FTk. The infinite 
Fibonacci tree yields static trees with almost minimal approximation ratio. For n 
> 1, it can be shown that a j^^n] < |log^ n ]. Thus, the approximation ratio of iF 

540 [n] is within 1 of the lower bound for all static trees of size n. If n is a Fibonacci 
number then !F[n] has the minimum approximation ratio for a static tree of its 
size. 

In accordance with an embodiment of an aspect of the invention, a 
dynamic tree algorithm proceeds by producing a new infinite merge tree for each 

545 new arrival. Suppose that r„_i is the infinite merge tree after processing the 
arrivals tu--.,t2,- Let ?i = yo, 3^1 , - . . , + 1 = ?n be the path from the root to f„ in 
r„_j . For each i < A; , we define r„'_i which is formed from r„_j as follows. Let 
C,={x: p{x) = and ;c> } . So x e C, if x is a child of y^ arriving later than 
tri. Define rj_j to be the tree T^_^ modified so that p^, = y, and p^, ^ (x) = 

550 for ally > i and x e C- . See FIG. 13 for an illustration of the transformation T'„_i 
to r^^i . The dynamic tree algorithm for T satisfies the following formal rule: 
either r„ = T^_, or r„ = Tl_, for some i such that 0<i<k and < 2f„_, - y._^ . 
This is a special case of the basic merging rule described above. The path yo, 
)^ is a prefix of the right frontier, which is the path from yo to fn- 1- It should be 

555 noted that although r„_i is fully labeled with arrivals (suggesting that it is 
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necessary to know the future arrivals and maintain an infinite tree), it can be 
assumed for implementation purposes that it is only labeled with the known 
arrivals ^l, . . . , fn - 1 • The algorithm knows the structure of the tree so that 
when tn becomes known it is made the label of the nth node in the tree. It can be 

560 seen inductively that r„ is an infinite preorder tree if r„_i is an infinite preorder 
tree. The tree T is, by definition, composed of infinitely many finite trees Ti, Ti, 
. . ., whose root is a child of the root of T. The tree T^_^ is numbered before the tree 
T. in a preorder traversal of T. Let n, be 1 plus the sum of the sizes of for; > 
i. As long as n<n^, only that part of T that includes the first i trees need be 

565 maintained. When n = n. + 1, the next tree T-^^ can be incorporated into the 
algorithm. This can be done because if /? < , the transition from to T„'_j 
leaves fixed all the trees Tj for j > i. 

It would be advantageous that the new tree, in the transition from 
to rj_i , be just as effective as the old tree for future arrivals in order for the 

570 dynamic tree algorithm to behave well. This turns out to be true if the algorithm 
is "cost-preserving," meaning that the new tree r„ = r„'_j for some 0<i<k such 
that y, -y^+ 2(k - i)it„ - ) > 0 . It can be shown that if this condition is true, 
then Mcost( T^_^ [m] ) > Mcost( rj_i [m] ) for all m>n. It can then be shown that if 
A is a dynamic tree algorithm for T that satisfies the cost preserving rule, then for 

575 all n, (n) < aj^^^ . Thus, the competitive ratio performance of the dynamic tree 
algorithm can be related with the approximation ratio of the static trees. Two 
classes of algorithms can be easily shown to satisfy the cost preserving rule, and 
therefore are bounded above by the approximation ratios of the prefixes of T, 
aj,[„3 : the "best fit" dynamic tree algorithm and the "nearest fit" dynamic tree 

580 algorithm for T. The "best fit" algorithm satisfies the rule that = T;'_j for an i 
that minimizes A,. „ = 2j'(?„-f„_i ) + - y, • The "nearest fit" algorithm satisfies 
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the rule that r„ = T^_^ if f„> 2/„_i - y^_^ for all 0 < / < A: and r„ = T„_^ if i is the 
largest number such that t^< 2t^_^ - . 

Since the infinite Fibonacci tree F has the best approximation 

585 ratios in the static situation, it makes sense to use it in a dynamic tree algorithm. 
Where the best fit dynamic tree algorithm uses an infinite Fibonacci tree (referred 
to by the inventors as a "best fit dynamic Fibonacci tree (BFDT) algorithm") and 
where the nearest fit dynamic tree algorithm uses an infinite Fibonacci tree 
(referred to by the inventors as a "nearest fit Fibonacci tree (NFDT) algorithm"), 

590 it can be shown that the merge cost competitive ratios are bounded by | log^ n \ . 
For the case in which there is an arrival every time slot, both algorithms have a 
constant competitive ratio. They are optimal when « is a Fibonacci number, since 
the Fibonacci tree is the optimal merge tree for such sequences, as discussed 
above. This is not the case with other values of n = + m , for > 3 and 

595 0<m< F^_^ . In this case, the optimal tree divides the arrivals into the left and the 
right subtrees according to the golden ratio. On the other hand, the size of the left 
subtree is always Ft- Nevertheless, the loss is not too big, and the competitive 
ratio is constant. Moreover, it can be shown that there is a bound on the full cost 
competitive ratios of the algorithms. Expressed using the parameter D = 1/L 

600 (which can be interpreted as the guaranteed maximum startup delay measured as a 
percentage of the stream length), the full cost competitive ratios of the best and 
nearest fit dynamic Fibonacci tree algorithms are bounded above by: 

min{| log^ (1 l{2D)) |+ 2, | log^ (n) |+ 2} 

605 

When D is very small the competitive ratio in the full cost is 0(log n) as is the 
competitive ratio for the merge cost. In the extreme, when D tends to zero, this 
models situations in which arrivals could happen at any time. However, it is very 
realistic to assume that n is very large and D is a constant. That is, clients tolerate 
610 some delay and the time horizon is long. In this case, the above equation yields a 
constant competitive ratio bound. As an example, suppose there is a two hour 
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video with a guaranteed maximum delay of 4 minutes. Then L = 30 and D = 1/30 
or about 3.33%. The best fit and nearest fit dynamic Fibonacci tree algorithms 
have competitive ratios bounded above, according to the above equation, by 8. 

615 Hence, it is known that these algorithms will never use more than 8 times the 

bandwidth required by an optimal off-line solution — and in common case arrivals 
will perform even better. 

The foregoing Detailed Description is to be understood as being in 
every respect illustrative and exemplary, but not restrictive, and the scope of the 

620 invention disclosed herein is not to be determined from the Detailed Description, 
but rather from the claims as interpreted according to the full breadth permitted by 
the patent laws. It is to be understood that the embodiments shown and described 
herein are only illustrative of the principles of the present invention and that 
various modifications may be implemented by those skilled in the art without 

625 departing from the scope and spirit of the invention. For example, many of the 
examples and equations have been presented in the context of a model in which 
the client receives data from two multicast channels. One of ordinary skill in the 
art can readily extend the various aspects of the above invention to clients that 
receive data from more than two multicast channels. 
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